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Monoallelic expression of ICF2 is regulated by 
CCCTC binding factor (CTCF) binding to the 
imprinting control region (ICR) on the maternal 
allele, with subsequent formation of an intrachromosomal 
loop to the promoter region. The N-terminal domain of 
CTCF interacts with SUZ1 2, part of the polycomb repres- 
sive complex-2 (PRC2), to silence the maternal allele. We 
synthesized decoy CTCF proteins, fusing the CTCF de- 
oxyribonucleic acid-binding zinc finger domain to CpG 
methyltransferase Sssl or to enhanced green fluorescent 
protein. In normal human fibroblasts and breast cancer 



MCF7 cell lines, the CTCF decoy proteins bound to the 
unmethylated ICR and to the IGF2 promoter region but 
did not interact with SUZ1 2. EZH2, another part of PRC2, 
was unable to methylate histone H3-K27 in the IGF2 pro- 
moter region, resulting in reactivation of the imprinted 
allele. The intrachromosomal loop between the maternal 
ICR and the IGF2 promoters was not observed when IGF2 
imprinting was lost. CTCF epigenetically governs allelic 
gene expression of IGF2 by orchestrating chromatin loop 
structures involving PRC2. 



Introduction 

The transcriptional regulator CCCTC binding factor (CTCF) 
controls the expression of several genes via chromatin insula- 
tion or enhancer blocking (Bell et al., 2001; Ohlsson et al., 
2001; Dunn and Davie, 2003; Recillas-Targa et al., 2006). 
Through the use of different combinations of its 1 1 highly con- 
served zinc fingers (ZFs; Mukhopadhyay et al., 2004), CTCF 
binds to sites within promoters, silencers, and insulators of 
genes involved in growth, proliferation, differentiation, apop- 
tosis, imprinting, and X chromosome inactivation (Filippova, 
2008; Donohoe et al., 2009; Ohlsson et al., 2010). More than 
15,000 CTCF binding sites have been identified throughout 
the genome (Kim et al., 2007). In addition, CTCF has recently 
been shown to act as a tethering protein, serving as a molecu- 
lar glue to secure long-range intrachromosomal (Kurukuti 
et al., 2006; Li et al., 2008) and interchromosomal (Ling et al., 
2006) interactions. 
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CTCF was initially shown to serve as an insulator in the 
chicken p-globin locus (Bell et al., 1999). The imprinting con- 
trol region (ICR) of the two coordinately imprinted genes IGF2 
and HI 9 was also identified as another CTCF target. The ICR, 
located in the 5' flanking region of the H19 gene and 90 kb 
downstream of the lgf2 gene, is maternally unmethylated and 
paternally methylated (Mann et al., 2000; Reik et al., 2000; 
Sasaki et al., 2000; Arney, 2003; Engel and Bartolomei, 2003; 
Murrell et al., 2004) The binding of CTCF to the unmethylated 
maternal ICR creates a physical boundary, blocking the inter- 
action of downstream enhancers with the remote 1GF2 promoters 
and silencing the maternal allele (Bell and Felsenfeld, 2000; 
Hark et al., 2000; Kanduri et al., 2000). 

Recent studies have demonstrated that CTCF regulates 
allelic expression of mouse Igf2 by forming a long-range intra- 
chromosomal loop (Murrell et al., 2004; Li et al., 2008). This 
complex intrachromosomal loop is established on the maternal 
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chromosome between the ICR and the Igf2 promoters, presum- 
ably through the formation of CTCF-CTCF dimers or multimers 
(Yusufzai et al., 2004; Li et al., 2008). CTCF also binds to com- 
ponents of the polycomb repressive complex-2 (PRC2), leading 
to silencing of the maternal allele through histone K27 methyla- 
tion (Li et al., 2008), thereby allowing the exclusive expression 
of H19 from the maternal allele and IGF2 from the paternal allele 
(Mann et al, 2000; Wolffe, 2000; West et al., 2002; Bartolomei, 
2003; Engel and Bartolomei, 2003). Biallelic expression of 
IGF2 may be seen in many cancer cells, although the molecular 
basis for this loss of imprinting is poorly understood. 

In this work, we extend our examination of the role of CTCF 
in orchestrating long-distance intrachromosomal looping in the 
human IGF2/H19 imprinting domain. We synthesized CTCF 
decoy proteins that contain the protein's evolutionarily conserved 
DNA binding domain to decipher the mechanisms through 
which CTCF controls allelic regulation of human IGF2. 

Results 

CTCF interacts with SUZ1S through its 
N-terminal CIMTJ domain 

CTCF recruits the PRC2 complex to induce allelic silencing 
through histone K27 methylation by interacting with SUZ12. 
We decided to characterize the CTCF domain that interacts with 
SUZ12. CTCF can be divided into three functional regions: the 
NT, DNA-binding ZF, and C-terminal (CT) domains. We con- 
structed each domain in a GST vector and purified recombinant 
proteins using a GST column (Fig. 1 A). Recombinant proteins 
composed of each domain were incubated with recombinant 
SUZ12. After pulldown, the GST-protein complexes were sep- 
arated on a PAGE gel and examined for their interaction with 
SUZ12 using Western blot analysis. As expected, SUZ12 was 
detected in the reaction with full-length CTCF (Fig. 1 B, lane 5), 
confirming the interaction between these two functional pro- 
teins as previously reported (Li et al., 2008). SUZ12 bound to 
the CTCF-NT domain (Fig. 1 B, lane 4), but neither the ZF do- 
main, which is required for DNA recognition and binding, nor 
the CT domain interacted with SUZ12. 

To study the role of the SUZ12 interaction in the mainte- 
nance of imprinting, we synthesized a decoy CTCF that main- 
tains the ability to bind target DNA sequences but is unable to 
interact with SUZ12. For this purpose, the ZF domain was 
amplified by PCR and was linked in frame to Sssl (CpG methyl- 
transferase and ZF-Sssl) or EGFP (tracking gene and ZF-EGFP). 
The addition of Sssl or EGFP allowed us to synthesize decoy 
CTCF proteins with a similar molecular mass as that of the full- 
length CTCF (83-kD native CTCF, 87-kD full-length ZF-Sssl, 
and 73-kD ZF-EGFP). Sssl is a DNA methyltransferase that is 
able to methylate CpG dinucleotides in some DNA sequences 
near the region where it binds, and EGFP is useful in tracking 
the transfected cells as a marker. To examine whether the decoy 
CTCFs interact with SUZ12, the purified recombinant decoy 
proteins or wild-type CTCF (Fig. 1 C) was incubated with cell 
extracts, pulled down using Ni-iminodiacetic acid columns, 
and examined for SUZ12 binding by Western blotting. As ex- 
pected, SUZ12 was detected in the reaction with full-length 



CTCF (Fig. 1 D, top panel, lane 2), but neither ZF-Sssl nor 
ZF-EGFP interacted with SUZ12 (Fig. 1 D, top panel, lanes 3 
and 4). As controls, full-length CTCF was detected by an anti- 
body that specifically recognizes the NT portion of CTCF 
(Fig. 1 D, second panel, lane 2), and decoy ZF-EGFP was de- 
tected by a GFP antibody (Fig. 1 D, third panel, lane 4). 

CTCF decays compete with endogenous 
CTCF in binding to the ICR of the 
IGFS/H19 domain 

The CTCF-Sssl and CTCF-EGFP genes were packaged into 
lentiviruses and transduced into human MCF7 (breast cancer), 
HBF1, and WSF7 (normal fetal skin fibroblasts) cells that main- 
tain normal IGF2 imprinting, i.e., the exclusive expression from 
the paternal allele. We used two control cell lines, one with a 
mock virus carrying the empty vector and one without the virus. 
The expression of CTCF-Sssl and CTCF-EGFP was measured 
by RT-PCR (Fig. 2, A and B) in stable cell clones selected with 
puromycin. Using EGFP as a tracking marker, we also observed 
the green fluorescence of the expressed ZF-EGFP fusion pro- 
tein in MCF7 and WSF7 cells (Fig. 2 C). Using an anti-EGFP 
antibody, we demonstrated the expression of the predicted fusion 
protein (Fig. 2 D, lanes 3 and 6), which has a larger molecular 
weight than the native EGFP control (Fig. 2 D, lanes 2 and 5). 

We then used a chromatin immunoprecipitation (ChIP) 
assay to detect the binding of the fusion proteins to the IGF2/ 
HI 9 ICR. Because there are no available antibodies to Sssl, we 
took an approach in which two antibodies were used to immuno- 
precipitate different domains of CTCF in ZF-Sssl-transfected 
cells. The first anti-CTCF antibody that we used specifically 
recognizes the CT domain of CTCF, which was deleted in the 
decoy CTCFs; thus, this antibody only detects the binding of 
the endogenous (native) CTCF. As expected, the ChIP PCR 
showed that the endogenous CTCF binds to CTCF sites 1 and 3 
in human fibroblast HBF1, WSF7, and breast cancer MCF7 cells, 
in which IGF2 imprinting is maintained (Fig. 2, E and F, lanes 
7 and 8). However, in decoy CTCF-expressing cell clones, no 
wild-type CTCF binding was detected using an antibody that 
recognizes the CT domain of CTCF (Fig. 2, E and F, lanes 3 
and 4), suggesting that the overexpressed ZF-Sssl decoy out- 
competes the endogenous CTCF for binding at the ICR. 

The second antibody specifically recognizes the CTCF ZF 
domain that is present in both the fusion protein and the endoge- 
nous CTCF. ChIP with this antibody showed that the ZF-Sssl 
binds to these two CTCF sites (Fig. 2, E and F, lanes 1 and 2). 

We also used these two antibodies to examine the CTCF 
binding to the sixth CTCF site in the ICR region. Again, the data 
support the finding that the viral overexpressed ZF-Sssl over- 
rides the endogenous CTCF in binding CTCF site 6 in both 
MCF7 and HBF1 cells (Fig. 2 G, lanes 1, 2, 5, and 6). 

In addition, we also confirmed the direct competition be- 
tween the ZF-EGFP decoy and the endogenous CTCF by using 
an anti-EGFP antibody and an antibody against the CT portion 
of CTCF. As expected, no GFP binding to the ICR sixth CTCF 
site was detected in normal MCF7 and WSF7 cells (Fig. 2 H, 
middle panel, lanes 1 and 2). However, in cell clones expressing 
ZF-EGFP, the decoy directly binds to the ICR in both human 
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Figure 1 . In vitro binding of recombinant proteins with SUZ12. (A) A schematic diagram of GST fusion constructs. CTCF-FL, full-length CTCF; NT, N-terminal; ZF, 
zinc finger domain; CT, C-terminal. (B) Binding of CTCF domains with SUZ12. Input: reaction aliquots collected before the pull-down reaction and ana- 
lyzed in parallel with the samples using Western blotting with an anti-human SUZ12 antibody. (C) A schematic diagram of His-tagged fusion constructs. 
CTCF-HIS, His-tagged full-length CTCF; ZF-Sssl-HIS, His-tagged ZF-Sssl ; ZF-EGFP-HIS, His-tagged ZF-EGFP. (D) A lack of interaction between His-tagged CTCF 
decoy proteins and SUZ1 2. The purified decoy proteins were incubated with cell extracts, and SUZ1 2 interaction was detected by Western blotting. 



fibroblast WSF7 and breast cancer MCF7 cells, as no binding of the 
native CTCF was detected (Fig. 2 H, lanes 3 and 4). Collectively, 
these data indicate that the overexpressed decoy proteins out- 
compete endogenous CTCF in binding to the 1GF2/H19 ICR site. 

CTCF binding near the IC3FS promoters is 
abolished by decoy CTCF proteins 

We then determined where CTCF bound in the IGF2 promoter 
region using ChlP. After immunoprecipitation, IGF2 promot- 
ers in the anti-CTCF-precipitated DNA were examined by PCR 
(Fig. 3 A). In control cells, wild-type CTCF binds to at least three 
sites in the promoter region (Fig. 3 A, d-f sites, which are near 
each of the imprinted promoters; and Fig. 3 B, top panel, lanes 
10-12), but not to the a site further upstream or the c site between 
promoter 1 and 2 (Fig. 3 B, top panel, lanes 7 and 9). In the CTCF- 
Sss 1 gene-transfected MCF7 clones, very weak or nonendogenous 
CTCF binding could be seen in these regions (Fig. 3 B, lanes 2-6). 
We also detected very weak or non-wild-type CTCF binding near 
the IGF2 promoters in CTCF-Sssl-transfected fibroblast HBF1 
cells (Fig. 3 C, lanes 2-6). The far upstream a site was included as 
a control site where Suzl2 binds independently without the partici- 
pation of CTCF (Fig. 3, B and C, lanes 1 and 7). 

SUZ12 does not directly bind DNA, but it associates with 
DNA by interacting with the CTCF in some cell lines (Fig. 1; 
Li et al., 2008). We examined the recruitment of SUZ12 to the 
IGF2 promoters by ChlP analysis and found that SUZ12 inter- 
acted in the four IGF2 promoter regions (Fig. 3 B, second 
panel, lanes 8 and 10-12). A positive site (a) further upstream 
was also included, where SUZ12 binding is CTCF independent. 



In CTCF-Sssl-expressing cells, however, there was no SUZ12 
binding at the IGF2 promoter region (Fig. 3 B, lanes 2-6), and 
SUZ12 binding persisted at the control site, a (Fig. 3 B, lanes 1 
and 7). These data show that the CTCF-Sssl protein, which 
lacks the CTCF NT domain, was unable to bind SUZ12. 

Loss of H3-KS7 methylation at the 
/GF2 promoters 

The CTCF-SUZ12 chromosomal interaction is required for 
specific monoallelic methylation of histone 3 lysine 27 (K27) in 
IGF2 promoters (Li et al., 2008). We examined whether the loss 
of the SUZ12 interaction would affect H3K27 methylation in 
the IGF2 promoter region. In control MCF7 cells, we observed 
H3K27 methylation at the IGF2 promoters (Fig. 3 B, third panel, 
lanes 10-12). In contrast, no H3K27 methylation was observed 
in this region in cells transfected with a CTCF-Sssl gene (Fig. 3 B, 
lanes 2-6). We also demonstrated that the CTCF-Sssl protein 
abolished the SUZ12 interaction and H3K27 methylation in the 
IGF2 promoters in transduced normal human skin fibroblast 
HBF1 cells (Fig. 3 C). 

Interruption of CTCF-mediated 
intrachromosomal and local looping 

We then examined CTCF-mediated intrachromosomal loop- 
ing using the chromatin conformation capture technique (3C; 
Dekker et al., 2002; Li et al., 2008). Cloned cells were fixed with 
1% formaldehyde, digested with restriction enzyme EcoRl, and 
then ligated with T4 DNA ligase. In cells in which imprinting 
is preserved, the IGF2 promoters directly interact with the 
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Figure 2. Overexpression of the decoy CTCF abolishes the IGF2/H19 ICR binding of the endogenous CTCF. The expression of decoy CTCF was measured 
by RT-PCR (A and B), fluorescence microscopy (C), and Western blotting (D). Binding of endogenous CTCF and decoy CTCFs to the ICR was detected by 
ChIP using two antibodies that recognize the CT and ZF regions of CTCF, respectively (E and F, CTCF sites l and 3; and G, CTCF site 6). The competitive 
binding of native CTCF and ZF-EGFP to the sixth site was detected by a GFP antibody that specifically recognizes the ZF-EGFP and an antibody against 
the CTCF CT domain (H). Bars, 400 pm. 
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diagram of the IGF2/H19 imprinting domain. The 
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ICR that is located 80 kb downstream (Fig. 4 A; Li et al., 2008; 
Vu et al., 2010). Control MCF7 breast cancer cells maintained 
the intrachromosomal interaction between the IGF2 promoters 
and the ICR (Fig. 4 B, lanes 3 and 4). In MCF7 cells that were 
transfected with the CTCF-Sssl decoy gene, however, no intra- 
chromosomal interaction was detected between the ICR and the 
IGF2 promoters (Fig. 4 B, lanes 1 and 2). The control T48/T49 
DNA was detected in all samples. The CTCF-Sssl decoy pro- 
tein also interrupted the IGF2 promoter-ICR intrachromosomal 
interaction in human skin fibroblasts (Fig. 4 C). 

Interestingly, we also noticed changes in the local chro- 
matin structure around the IGF2 promoters after the expression 
of the CTCF-Sssl protein. In control cells, interactions between 
promoter P2 (lb) and promoter P4 (3b), as well as among 4b, 
5b, and 6b (differentially methylated region-2 [DMR2]), were 



observed (Fig. 5 A, lanes 3 and 4). In CTCF-Sssl-expressing 
cells, however, some local chromatin interactions were lost, 
whereas other new interactions appeared (Fig. 5 A, lanes 1 and 2). 

To identify which allele is involved in those lost and new 
local interactions, we sought two single nucleotide polymorphism 
sites to distinguish between the parental alleles by endonucleases 
Bfal and Nallll (Fig. 5, B and C, top). In control cells, the lb-3b 
interaction between P2 and P4 was biallelic (Fig. 5 B, lanes 7 and 8). 
In cells expressing the ZF-Sssl, the new interaction between P2 
and P3 was also biallelic (Fig. 5 B, lanes 1 and 2). However, in the 
DMR2 region, the lost and new interactions were monoallelic 
(Fig. 5 C, lanes 3-6). These data suggest that CTCF may also be 
involved in maintaining local chromatin structure around the 
IGF2 imprinting locus, although these interactions may not di- 
rectly participate in the regulation of allelic expression. 



Decoy CTCF abolishes IGF2 imprinting • Zhang et al. 479 



IGF2 



hP1 

5m- 



EcoR l-3C-sites 



PCR Primer 



hP2 hP3 hP4 



H19 



I I I 

12 3 
1b 2b 3b 
T48 T49 



HSS 

■ 80kb 


CTCF r 


Enham 


I 

6 


I 

7 


8 





4b 5b 6b 



7b 8b 



B 



150 bp 
100 bp 

150 bp 



100 bp 
150 bp 

100 bp 



MCF-7 



ZF-Sss1 Control 



— ■*— 1 b/7b ( 93 bp ) 



2b/8b ( 119 bp ) 



HBF-1 



ZF-Sss1 Control 



— 4— T48/T49 (105 bp ) 



150 bp- 
100 bp 

150 bp 
100 bp 

150 bp 
100 bp 



«— 2b/7b( 115 bp ) 



3b+8b ( 82 bp ) 



— «— T48/T49 (105 bp ) 



Figure 4. Intrachromosomal loop between ICF2 promoters and the ICR. (A) A schematic diagram of the EcoRI sites in the IGF2/H 1 9 imprinting locus used 
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Aberrant: imprinting of ItSFS by 
decoy CTCF 

We then examined whether allelic expression of IGF2 was af- 
fected in MCF7 and HBF1 cells transfected with CTCF-Sssl. 
We detected monoallelic expression of IGF2 in control and 
mock-treated MCF7 cells (Fig. 6 A, lanes 9-12). In the cells 
transfected with CTCF-Sssl, IGF2 became biallelically ex- 
pressed (Fig. 6 A, lanes 1-6). Similarly, the decoy protein 
also abolished IGF2 imprinting in human skin fibroblast cells 
(Fig. 6 B, lanes 1-6). To confirm the observations, we trans- 
duced cells with another CTCF analogue that fuses the CTCF 
ZF domain with EGFP. This CTCF-EGFP decoy protein also 
induced biallelic expression of IGF2 in both MCF7 cells (Fig. 6 C, 
lanes 1-3). Similar data were also seen in a second human fetal 
fibroblast cell line, WSF7 (Fig. 6 D, lanes 1-3). 

Genomic imprinting of H19 

In mice, the monoallelic expression of Igf2 is closely coordi- 
nated in a reciprocal fashion with H19 imprinting through 
CTCF-ICR insulation (Bartolomei et al., 1993; Bell and 
Felsenfeld, 2000; Hark et al., 2000). The CTCF insulator marks 
the boundary in the ICR that is differentially methylated on the 
two parental alleles. CTCF binds to the unmethylated maternal 
CTCF DMR and insulates the Igf2 promoter from the remote 
enhancer downstream of HI 9. However, allelic methylation of 
the paternal ICR prevents the binding of CTCF and thus allows 
the exclusive expression of H19 from the maternal allele and 
Igf2 from the paternal allele (Mann et al., 2000; Wolffe, 2000; 
West et al., 2002; Bartolomei, 2003; Engel and Bartolomei, 
2003). We examined H19 imprinting in our cell lines and found 



that HI 9 was biallelically expressed in MCF7 cells transfected 
with CTCF-Sssl (Fig. 7 A) and with CTCF-EGFP (Fig. 7 B) but 
remained monoallelically expressed in human fibroblast HBF1 
(Fig. 7 C) and WSF7 (Fig. 7 D) cells. Thus, imprinting of the 
human IGF2 and H19 can be uncoupled in a manner similar to 
that as observed in human tumors (Feinberg, 1993; Cui et al., 
2002; Ulaner et al., 2003; Chen et al., 2006). 

Sssl is a CpG DNA methylase. Using sodium bisulfite se- 
quencing, we did not observe any alteration of DNA methylation 
at the ICR DNA (Fig. SI). Thus, as reported in human tumors, bi- 
allelic IGF2 expression induced by the decoy protein is not nec- 
essarily accompanied by altered DNA methylation in the ICR. 

Aberrant: IGFS and HIS genomic imprinting 
affects gene expression 

Loss of genomic imprinting is one of the factors that contributes 
to the increased expression of IGF2 in some tumors. We thus 
were interested in whether loss of IGF2 imprinting induced by 
decoy CTCFs would alter IGF2 expression. We quantitated IGF2 
mRNA transcripts by real-time PCR (Fig. 8). IGF2 is monoalleli- 
cally expressed in the breast cancer cell line MCF7 and in normal 
breast tissue. There was increased expression of IGF2 mRNA in 
the CTCF decoy-expressing cells, where IGF2 was biallelically 
expressed (Fig. 8, A and B). Similarly, loss of HI 9 imprinting also 
led to increased H19 abundance in MCF7 cells (Fig. 8 A). In nor- 
mal fibroblasts, where H19 remained imprinted despite CTCF 
decoy expression, HI 9 gene expression did not increase. We also 
observed enhanced cell proliferation in human fibroblast cells 
that show biallelic expression of IGF2 (Fig. S2). There were no 
obvious changes in the cell morphology of the infected cells. 
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Figure 5. Allele-specific alteration of local chromatin structure. (A) Alterations of local chromatin structure in decoy ZF-Sssl -expressing MCF7 cells. 
(B) Biallelic alteration of local chromatin structure near IGF2 promoters. After PCR, two parental alleles were distinguished by Bfal polymorphic restric- 
tion enzyme. (C) Monoallelic alteration of local chromatin structure in the IGF2 DMR2 region. Two parental alleles were separated by Nallll polymorphic 
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Discussion 

In our previous study (Li et al., 2008), we demonstrated that 
CTCF and Suzl2 are coprecipitated from nuclear extracts and 
interact with each other in a two-hybrid system. RNAi knock- 
down of Suzl2 also leads to reactivation of the maternal Igf2 



allele and biallelic Igf2 expression. Our data in this study pro- 
vide further evidence that CTCF is an active participant in con- 
trolling allelic expression of IGF2 (Fig. 9). CTCF binds to 
regions around each of the IGF2 promoters as well as to the un- 
methylated ICR on the maternal allele. CTCF molecules from 
these distant loci can dimerize, leading to the formation and 
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Figure 6. Loss of IGF2 imprinting in decoy 
CTCF-expressing cells. (A] ZF-Sssl -expressing 
breast cancer MCF7 cells. (B) ZF-Sss 1 -expressing 
human skin fibroblast HBF1. (C) ZF-EGFP- 
expressing MCF7. (D) ZF-EGFP-expressing human 
skin fibroblast WSF7 cells. Allelic expression 
of IGF2 was determined by polymorphic restric- 
tion enzymes Alul (A, C, and D] and Apal (B). 
gDNA, genomic DNA as the positive control; 
mock, empty lentiviral vector that does not contain 
the decoy constructs; control, untreated cells. 
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reinforcement of intrachromosomal loops. By orchestrating 
chromatin loop structures, CTCF serves as a DNA-binding pro- 
tein scaffold to recruit and bind polycomb repressive complexes 
and deliver the parent-specific H3K27 methylation signal to the 
remote IGF2 promoters, leading to suppression of maternal 
IGF2 expression. Through the direct interaction of SUZ12 with 
the DNA-bound CTCF (Li et al., 2008), the PRC2 complex is 
recruited specifically to the maternal promoters, where it meth- 
ylates H3-K27, leading to the formation of a repressive chromatin 
state on the maternal allele. CTCF cannot bind to the methyl- 
ated paternal ICR, and, thus, there is no scaffold to secure PRC2 
to that site (Bell and Felsenfeld, 2000; Hark et al., 2000). In the 
absence of CTCF-PRC2 complex binding, a more relaxed chro- 
matin structure is achieved, the paternal IGF2 promoters are 
able to access the downstream enhancers, and the gene is tran- 
scribed in an allele-specific manner. 

CTCF plays a critical role in regulating the allelic expres- 
sion of IGF2. By binding to its target sites at the promoter region 
and at the distant ICR, CTCF complexes can tether a long-range 
chromatin loop that delivers the parent-specific methylation sig- 
nal in the ICR to the remote IGF2 promoters that do not carry 
any imprinting marks. Interruption of any components in this 
pathway, including aberrant ICR DNA methylation, decreased 
expression of PRC2 proteins, mutation of the ICR, and altered 
H3-K27 methylation, will cause a loss of IGF2 imprinting. 



Because CTCF regulates many genes by binding to promoters, 
enhancers, and silencers, it would be of great interest to explore 
whether CTCF-mediated recruitment of PRC2 proteins is a nec- 
essary common mechanism in regulating these targeted genes. 

In mouse, Igf2 and H 19 are tightly coordinated and recipro- 
cally imprinted (Bartolomei et al., 1993) through the CTCF insu- 
lating effect on the unmethylated maternal CTCF DMR (Bell and 
Felsenfeld, 2000; Hark et al., 2000). However, allelic methyla- 
tion of the paternal ICR abrogates the binding of CTCF and thus 
allows the exclusive expression of H19 from the maternal allele 
and Igf2 from the paternal allele (Mann et al., 2000; Wolffe, 
2000; West et al., 2002; Bartolomei, 2003; Engel and Bartolomei, 
2003). Deletion or mutation of the CTCF DMR relaxes the 
normally silent maternal Igf2 allele (Thorvaldsen et al., 1998; 
Srivastava et al., 2000; Szabo et al., 2004). In this study, however, 
we found that monoallelically expressed H19 and biallelically 
expressed IGF2 coexisted in decoy CTCF-transfected human 
fibroblast HBF1 and WSF7 cells (Figs. 6 and 7). Thus, in human 
1GF2/H19, the regulation of these two reciprocally imprinted 
neighboring genes can be uncoupled. This uncoupling has previ- 
ously been reported in a variety of human tumors (Feinberg, 
1993; Cui et al., 2002; Ulaner et al., 2003; Chen et al., 2006). 

It should be emphasized that all genetic manipulations, 
including ICR deletions and/or point mutations in the ICR, 
designed to prevent CTCF binding, are affected before Igf2 
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Figure 7. Decoy proteins induce biallelic 
expression of HI 9 in MCF7 but not in HBF1 
or WSF7 cells. (A-D) The two parental alleles 
of HI 9 were distinguished by polymorphic 
restriction enzyme Rsal. Note the biallelic 
expression of HI 9 in MCF7 clones but mono- 
allelic expression of H19 in HBFl and WSF7 
clones. (B and D] Allelic expression of H? 9 in 
MCF7 (B) and fibroblast WSF7 cells (D) that 
are stably transfected with ZF-EGFP decoy pro- 
tein construct. Control, untreated cells; mock, 
virus carrying the empty vector. 
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imprinting is set up in early embryos. However, in human tumors, 
biallelic IGF2 expression occurs as a result of aberrant regula- 
tion of the imprinting maintenance mechanism after imprinting 
has already been established. In addition, all ICR knockout or 
mutation experiments have been conducted in mouse models, 
and the regulation of IGF2/H19 imprinting may depend on 
species-specific factors. Thus, the CTCF decoy approach pro- 
vides an alternative strategy to explore possible mechanisms 
underlying loss of IGF2 imprinting in human cells. 

The critical involvement of CTCF in maintaining normal 
monoallelic expression of IGF2 suggests that the CTCF reg- 
ulatory pathway may be dysfunctional in human tumors in 
which IGF2 is biallelically expressed (Thorvaldsen et al., 1998; 
Schoenherr et al., 2003; Pant et al., 2004). A maternally trans- 
mitted microdeletion of two CTCF binding sites in the ICR results 
in biallelic IGF2 expression and H19 silencing in Beckwith- 
Wiedemann syndrome (Sparago et al., 2004). Using nuclear 
transfer, we previously showed that loss of IGF2 imprinting in 
human tumor cells was reversed by the imprinting machinery in 
normal fibroblast cytoplasm, leading to monoallelic expression 
of IGF2 in the reconstructed tumor cybrids or hybrids (Chen et al., 
2006). Moreover, this epigenetic resetting of IGF2 imprinting 
in tumors was not accompanied by any changes in DNA 
methylation at any of the DMRs (DMR 0 , ICR, and K V DMR!). 



Recently, we also showed that the CTCF-dependent intrachro- 
mosomal loop was lost in human fibroblasts in which IGF2 is 
biallelically expressed after cycloheximide treatment, despite 
the fact that DNA methylation in the ICR was not altered (un- 
published data). 

Similarly, we did not detect any alteration of DNA meth- 
ylation in the ICR in CTCF-Sssl-transfected MCF7 and HBFl 
cells (Fig. SI). Sssl is a CpG dinucleotide methylase cloned 
from Spiroplasma monobiae strain MQ1 (Renbaum et al., 1990). 
It de novo methylates DNA exclusively at CpG sites in vivo and 
in vitro without sequence preference. In a previous study by 
Xu and Bestor (1997), Sssl methylase, when fused to the CT 
domain of the Zif268 ZF domain or a Zif268 derivative that 
binds to the p53 binding sites, was able to induce de novo DNA 
methylation specifically at the target sites. In a parallel study 
with mouse fibroblasts, we have found that our CTCF-Sssl con- 
struct is able to methylate CTCF binding sites at the K v DMRi in 
the Kcnql imprinting locus (Fitzpatrick et al., 2007), which is 
^700 K from mouse Igf2 (unpublished data). Thus, the gene 
locus and its local chromatin structure may be critical factors 
that determine the ability of CTCF-Sssl to methylate nearby 
DNA. Alternatively, the methylase, when fused with the ZF do- 
main, may become inactive in this locus by an unknown protec- 
tion mechanism. Collectively, these findings suggest that, unlike 
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Figure 8. Real-time PCR quantitation of IGF2 and H19 mRNA transcripts. 

IGF2, HI 9, and the housekeeping f}-actin genes were coamplified from 
each cDNA synthesized from normal tissue (human breast and skin), con- 
trol, and decoy CTCF-expressing cells in MCF7 (A), HBF1, and WSF7 
cells (B). IGF2 and HI 9 were quantitated in duplicate for each sample 
and were determined by a ACT and A— ACT calculation with reference 
to human [3-actin gene control. IGF2 and HI 9 expression was normal- 
ized and presented as the number by using the IGF2 and HI 9 level in 
controls (white) as 1 (tissue, n = 6; control, n = 6; treatment, n = 6). 
*, P < 0.05 relative to IGF2 and H / 9 expression in the MCF7 control (A) 
and fibroblast control (B). All data are presented as means ± SD of three 
independent experiments. 



the mouse Igf2, loss of IGF2 imprinting in tumors may not nec- 
essarily be accompanied by changes in DNA methylation in 
known ICRs, but rather may be related to dysfunction of any of 
the components of the regulatory network, including the intra- 
chromosomal loop, the ICR, CTCF, or the PRC2 complex. In 
support of this hypothesis, it is known that human tumors often 
show a lack of correlation between DNA methylation and IGF2 
imprinting status (Moore et al., 1997; Sullivan et al., 1999; Cui 
et al., 2002; Chen et al., 2006). 

It also should be noted that CTCF regulates many genes 
by binding to promoters, enhancers, and silencers. The CTCF- 
PRC2 complex may play a role in the regulation of many other 
genes that are regulated by CTCF, such as c-myc, p-globin, am- 
yloid (3-protein precursor, and X-inactivated genes. Furthermore, 
CTCF has been described as the master weaver of the genome, 
facilitating intrachromosomal as well as interchromosomal inter- 
actions (Phillips and Corces, 2009). For example, CTCF medi- 
ates an interchromosomal colocalization between the lgP.IH19 
ICR on mouse chromosome 7 and Wsbl/Nfl on mouse chromo- 
some 11 (Ling et al., 2006). Thus, it would be of great interest 
to learn whether decoy CTCF proteins alter interchromosomal 



associations and, in addition, the expression of these CTCF- 
regulated genes. Furthermore, the model as described in Fig. 9 
does not explain the involvement of the H19 enhancer. Further 
studies are needed to delineate how the enhancer is coordinated 
with the CTCF-Suzl2-RPC2 complex in regulating allelic 
expression of IGF2. 

Materials and methods 

GST-CTCF constructs 

To construct recombinant GST fusion proteins, the various CTCF cDNA 
fragments (full-length CTCF, CTCF-NT, CTCF-ZF, CTCF-ZFCT, and CTCF-CT) 
were generated from template pOBT7-CTCF vector by PCR amplification 
containing BamHI-Xhol restriction enzyme (Table SI ) using pfu polymerase 
(Agilent Technologies) and were cloned into pGEX-4T-2 vector (Invitrogen). 
The PET-24b-SUZl 2 construct was generated by cloning the full-length 
SUZ1 2 cDNA with Sall-Notl site digested from pCMV-SPORT6 (Invitrogen) 
into pET-24b vector. 

CTCF fusion protein construction 

The Sssl DNA methyltransferase DNA was amplified from the genomic 
DNA of S. monobiae strain MQ1 (33825; American Type Culture Collec- 
tion). To enhance translation in mammalian cells, four TGA codons that en- 
code tryptophan residing in the S. monobiae were converted into TGG by 
PCR ligation. The cDNA fragment encoding the CTCF ZF domain was gen- 
erated from the pOBT7-CTCF vector by PCR amplification. These two DNA 
fragments were then linked by SV40 NLS and a short linker sequence to 
produce the ZF-Sssl construct, which was then cloned into the Nhel-BamH 
sites in pCDH-CMV-MCS-EF 1 -Puro lentivirus vector (SBI). EGFP was ampli- 
fied from EGFP-N1 vector (Takara Bio Inc.) and was used to replace Sssl 
to generate pNl-CTCF ZF-EGFP construct (Table S2). 

Recombinant His-tagged constructs 

To construct recombinant His-tagged fusion proteins, the full-length CTCF 
and decoy CTCF (ZF-Sssl and ZF-EGFP) fragments with Nhel and Notl 
sites were generated from template pOBT7-CTCF vector pCDH-ZF-Sssl 
and pCDH-ZF-EGFP by PCR, respectively. These fragments were then 
cloned into pET-24b vector by framing with His to generate His-tagged 
fusion proteins (Table S3). 

Protein interaction in vitro 

All recombinant GST fusion, His-tagged, and SUZ12 proteins were ex- 
pressed in KRX Escherichia coli strain according to the manufacturer's pro- 
tocol (Promega). The in vitro protein interaction assay was performed as 
described previously (ti et al., 2008). In brief, purified GST fusion and His- 
tagged proteins (10 ug) were incubated with 50 pi of cell-free supernatant 
containing recombinant SUZ12 in 200 pi of binding buffer (20 mM Tris- 
HCI, pH 7.5, 100 mM NaCI, 2.5 ng/ml BSA, 10 mM EDTA, 0.1% Triton 
X-l 00, 1 mM DTT, and 1 0% glycerol). After incubation for 3 h at 4°C, the 
glutathione particles (BD) or His tag isolated beads (Invitrogen) containing 
the protein complex were pulled down and washed three times. Binding 
proteins were eluted for Western blotting using anti-SUZl 2, anti-CTCF, and 
anti-GFP antibodies (Abeam). 

Lentiviral transduction 

The lentiviruses were generated in 293SF-PacLV cells according to the pro- 
tocol provided by the manufacturer. The viral supernatants were filtered 
through a 0.45-mm filter, concentrated by the PEG-IT kit (SBI), and aliquoted 
in a — 80°C freezer for long-term storage. Human fibroblasts and tumor 
cells were seeded at 1 .0 x 1 0 5 cells per well of a 6-well plate 24 h before 
transduction. The medium was replaced with virus-containing supernatant 
containing 5 mg/ml polybrene (Sigma-Aldrich) and incubated overnight. 
24 h after transduction, the virus-containing medium was replaced with 
fresh medium for further culture. 4 d later, the infected cells were harvested 
by trypsinization and replated on a new 100-mm dish. When cell conflu- 
ence reached 20-30%, the culture medium was replaced by fresh medium 
containing 1 pg/ml puromycin for colony selection. The medium was 
changed every 3-4 d. After selection, colonies were chosen and expanded 
for further analyses. 

Chromosome conformation capture (3C) 

The 3C assay was performed as described previously (Dekker et al., 
2002; Li et al., 2008). In brief, 1 .0 x 10 7 cells were cross-linked with 2% 
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Figure 9. Model of the IGF2/H19 imprinting region. Aberrant IGF2 imprinting induced by the decoy CTCF. In cells that maintain normal IGF2 imprinting, 
CTCF orchestrates an intrachromosomal loop by binding to both the unmethylated ICR and to the IGF2 promoters on the maternal allele. CTCF then dimer- 
izes, ' 
histon 

abrogating the IGF2 promoters/ICR i.... 

resulting in the reactivation of the normally suppressed maternal allele. 



formaldehyde and lysed with cell lysis buffer (10 mM Tris, pH 8.0, 1 0 mM 
NaCI, 0.2% NP-40, and protease inhibitors). Nuclei were collected, sus- 
pended in 1 x restriction enzyme buffer in the presence of 0.3% SDS, and 
incubated at 37°C for 1 h. Triton X-l 00 was then added to a final concen- 
tration of 1 .8% to sequester the SDS. An aliquot of nuclei (2 x 1 0 6 ) was di- 
gested with 800 U of restriction enzyme EcoRI at 37°C overnight. After 
stopping the reaction by adding 1 .6% SDS and incubating the mixture at 
65°C for 20 min, chromatin DNA was diluted with ligation reaction buffer 
(New England Biolabs, Inc.), and 2 pg DNA was ligated with 4,000 U T4 
DNA ligase (New England Biolabs, Inc.) at 16°C for 4 h (final DNA con- 
centration of 2.5 ug/ml). After treatment with 10 mg/ml proteinase K at 
65°C overnight to reverse cross-links and with 0.4 pg/ml RNase A for 30 min 
at 37°C, DNA was extracted with phenol-chloroform, ethanol precipitated, 
and used for PCR amplification of the ligated DNA products. PCR primers 
used in this study are listed in Table S4. 

CHIP 

ChIP assays were performed as described previously (Li et al., 2008). 
In brief, five million cells were fixed with 1 % formaldehyde and sonicated for 
1 80 s (1 0 s on and 1 0 s off) on ice with a Branson sonicator with a 2-mm 
microtip at 40% output control and 90% duty cycle settings. The 1 ml of 
sonicated chromatin was clarified by centrifugation, aliquoted, and snap 
frozen in liquid nitrogen. To perform ChIP, 150 pi of sonicated chromatin 
was diluted 1 0-fold and purified with 2-5 pi of specific antiserum and 60 pi 
protein G-agarose. Antibodies to GFP, CTCF, SUZ12, and dimethyl-H3- 
K27 (lysine 27 of histone H3) were obtained from Abeam. DNA that was 
released from the bound chromatin after cross-linking reversal and pro- 
teinase K treatment was precipitated and diluted in 1 00 pi of low TE buffer 
(1 mM Tris and 0. 1 mM EDTA). PCR reactions were performed under liquid 
wax in a reaction containing 1 pi ChIP (or input) DNA, 0.5 pM of appro- 
priate primer pairs, 50 pM deoxynucleotide triphosphate, and 0.2 U 
Klen-Taql (Ab Peptides). Standard PCR conditions were 95°C for 2 min 
followed by 35 cycles of 95°C for 1 5 s, 65°C for 30 s of annealing, and 
72°C for 30 s of extension (Table S5). The PCR products were separated 



on a 5% polyacrylamide-urea gel and quantified by a Phosphorlmager 
(Molecular Dynamics). 

Examination of IGF2 and HI 9 imprinting 

Total RNA was extracted from tissues by TRI-REAGENT (Sigma-Aldrich) ac- 
cording to the manufacturer's guide, and cDNA was synthesized with RNA 
reverse transcription. Genomic imprinting of IGF2 was examined by PCR 
in cDNA samples as previously described using primers specific for two 
polymorphic restriction enzymes (Apal and Alul) in the last exon of human 
IGF2. After PCR, two parental alleles were distinguished by the digestion 
of polymorphic restriction enzymes Alul and Apal and were separated on 
5% polyacrylamide gel (Hu etal., 1996, 1997; Chen etal., 2006). Note 
that in CTCF decoy-expressed cells, the imprinted A allele became acti- 
vated, leading to biallelic expression of IGF2. Allelic expression of HI 9 was 
assessed by polymorphic restriction enzyme Rsal. PCR primers used to 
measure allelic expression of IGF2 and H19 are listed in Table S6. 

Real-time RT-PCR 

Quantitative real-time RT-PCR amplification was performed using Quanti- 
Tect SYBR green (QIAGEN) as previously described (Chen et al., 2006). 
Specifically, total RNA was extracted by TRIZOL reagent (Invitrogen), and 
cDNA was synthesized with RNA reverse transcription. The threshold cycle 
(Ct) values of IGF2 and H / 9 were quantitated by quantitative PCR in dupli- 
cate using a sequence detector (ABI Prism 7900HT; Applied Biosystems) 
according to the manufacturer's protocol and was normalized over the Ct 
of the /3-actin control (Table S7). 

DNA methylation analysis 

Total nucleic acids extracted from fibroblast and tumor cells were used to 
examine DNA methylation patterns. As previously described (Ulaner et al., 
2003), total nucleic acids were treated with sodium bisulfite, and DNA in 
the IGF2/H19 DMR was amplified with DNA methylation-specific primers 
designed for CTCF binding sites (Table S8). After PCR, methylated and un- 
methylated DNAs were separated by Mlul. To examine the status of DNA 
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methylation in every CpG site in the key sixth CTCF binding region, the am- 
plified PCR DNAs were cloned into TA vector (Invitrogen) and were se- 
quenced using the vector primer. 

Western blot analysis 

Protein interaction was determined by Western blotting as previously de- 
scribed. Pull-down protein complexes were separated by SDS-PAGE in 1 2% 
(wt/vol] polyacrylamide gels and transferred to polyvinylidene fluoride mem- 
branes. Membranes were used in immunoblotting with anti-SUZ12, anti- 
CTCF, and anti-GFP antibody (Abeam). 

Survival assay 

Cells were seeded at 5,000 cells per well in flat-bottomed 96-well plates. At 
the end of the incubation time, 20 pi of 5-mg/ml MTT (3-(4,5-dimethylthiazol- 
2-yl]-2,5-diphenyltetrazolium bromide; Sigma-Aldrich) in PBS was added to 
each well. After 4 h, media were discarded, and cells were lysed with 1 00 ul 
DMSO. Cells were incubated for a further 30 min at 37°C with gentle shak- 
ing. The optical density was determined with a microplate reader at 570 nm. 
Absorbance values were normalized to the values of control cells at day 1 
to calculate the percentage of survival. 

Statistical analysis 

All experiments were performed in triplicate, and the data were expressed 
as mean ± SD. The comparative Ct method was applied in the quantitative 
real-time RT-PCR assay according to the A— ACt method. The data were ana- 
lyzed with f test or by one-way analysis of variance, and results were con- 
sidered statistically significant at P < 0.05. 

Online supplemental material 

Fig. SI shows DNA methylation at the CTCF ICR in the IGF2/H19 im- 
printing domain. Fig. S2 shows the growth characteristics of decoy CTCF- 
expressing cells. Table SI shows primers used for construction of various 
GST-CTCF fusion proteins. Table S2 shows primers used for construction 
of decoy CTCF vectors. Table S3 shows primers used for construction of 
His-tagged fusion proteins. Table S4 shows primers used for 3C at the 
IGF2 locus. Table S5 shows primers used for ChIP assays. Table S6 
shows primers used for allelic expression of IGF2 and HI 9 imprinted 
genes. Table S7 shows primers used for real-time PCR. Table S8 shows 
primers used for DNA methylation of the sixth CTCF binding site in the 
ICR of the IGF2/H19 locus. Online supplemental material is available at 
http://www.jcb.org/cgi/content/full/icb.201 1 01 02 1 /DC 1 . 
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